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1 Introduction 

Processes involving third generation quarks play a key role in collider phenomenology The 
reason is twofold: on the one hand, bottom and top quarks have a very peculiar signatures 
with respect to light quarks that make possible an easy flavor identification. Experimen- 
tally bottom quarks can be efficiently identified, for instance, through displaced vertices in 
the detector while top quarks can be detected indirectly via their decay to bottom-quarks 
and a W boson. On the other hand, they bring about special theoretical attention. Be- 
ing the heaviest among all quarks, they do interact most strongly with the electroweak 
symmetry breaking sector in the standard model (SM) and in many extensions thereof. 
A light Higgs boson, for example, predominantly decays into bottom-antibottom quark 
pairs, providing the main decay channel where the Higgs associated production mecha- 
nisms {WH, ZH, ttH) can be searched for. In addition, in models like super symmetry, 
where the coupling of the Higgs to bottom quarks can be comparable to that of top quark 
in the SM, associated production with bottom quarks can provide the largest production 



mode. Many of the searches for the Higgs and for new physics in general start from final 
state signatures which include bottom quarks. The need for accurate theoretical predictions 
for such processes, both signal and backgrounds, is therefore very well motivated. 

Calculations of high-energy processes involving the production of bottom quarks (and 
heavy quarks in general) are typically performed in two different ways. One option (which, 
for simplicity, will be referred to as the " massive" or 4- flavor (4F) scheme in the following) is 
the most straightforward from the conceptual point of view: bottom quarks are significantly 
heavier than the proton and therefore can be only created in pairs (or singly in association 
with a top quark, in the case of weak interactions) in high-Q^ interactions. In this approach, 
the heavy quark does not contribute to the proton wave-function, and can only be generated 
as a massive final state, where the heavy quark mass can act as an infrared cut-off for the 
more inclusive observables. More technically, for this approach to be reliable, one assumes 
the heavy quark mass to be of the same order as the other hard scales involved in the 
process. At the same time, 6-parton distribution functions are set to zero. In practice, the 
massive scheme amounts to employ an effective theory with n; light quarks, where the heavy 
quarks are decoupled and do not enter in the computation of the running coupling constant 
and in the evolution of the PDFs. There are many (differential) calculations available 
at NLO in QCD performed in this scheme relevant for hadron collider phenomenology, 
including among the most important ones, pp ^ bb + X [1, 2], pp ^ 4b + X [3], pp — >• 
ttbb + X [4, 5],pp^ tbj + X [6],pp^ H^tb + X [7], pp ^ (pbb + X with (l) = H,A [8-11], 
and pp -^ Vbb + X with V = W,Z [12-18]. However, no prediction at NNLO in QCD for 
any process of relevance at the LHC is available in this scheme to date. 

Alternatively, one may face a situation where the typical scale Q of the process, such 
as the pt of the bottom quark itself or the mass weak boson in the final state, is way 
higher than the mass of the heavy quark and logarithms of the type log -^ (which can be 
of initial or final state nature) appear that might spoil the convergence of a fixed-order 
perturbative expansion. In this case, it is natural to consider a scheme where heavy quark 
mass are considered as small parameters, power corrections of the type (m^/Q^)" are 
pushed to higher orders and towers of log'" -^ appearing at all orders explicitly resummed 
via usual Altarelli-Parisi evolution equations. We dub this class of approaches and their 
systematic improvements as "massless" or 5-fiavor (5F) schemes.^ In so doing, initial- 
state large logarithms are resummed into a b distribution function and final-state ones into 
perturbative fragmentation functions. In fact initial- and final-state logarithms have a very 
different status, the latter being not only better understood but also possibly avoided by 
introducing more inclusive observables than those related to i?-mesons, such as 6-jets [19] 
or even 66-jets [20] which are not affected in the first place by such large logs. Different is 
the case for initial state 6's, on which we focus in this work. Many are the (differential) 
NLO calculations available in this scheme too that are relevant at LHC: pp -^ tj + X [21- 



^We draw the reader's attention to the fact that the naming "massive" and "massless" is conventional 
and it does not imply anything on whether power-like bottom mass effects are included or not in actual 
calculations. There are many accurate predictions available performed in the massive scheme which do not 
include mass effects because they are small for the observables of interest. On the other hand, in massless 
"improved" schemes mass effects are normally taken into account when higher-order terms are included. 



26] pp^tW + X [24, 27], pp -^ H^t + X [28, 29] pp -^ <j){bb) + X [30], pp -^ (j)b{b) + X 
with (j) = H,A [31], pp^Vb + X [32, 33], and pp -^ Vbj + X [34, 35] with V = W,Z. In 
addition, pp — )• (/>(66) + X with cp = H,A [36] and pp — >■ Z(66) + X [37] are also available 
at NNLO. 

Both schemes present advantages and disadvantages and are typically adopted in com- 
plementary regimes depending on the relative size of the heavy quark masses relative to 
the scale Q of the process. In turns out that in the massless schemes calculations are highly 
simplified: in the Born configuration the number of external legs as well as the scales in 
the processes are reduced. Mass and "spectator heavy quarks" effects appear at higher 
orders and can be systematically added. In addition, as mentioned above, potentially large 
logarithms of the ratio Q'^ /m"^ arising from collinear splitting of the initial heavy quarks 
and gluons, are consistently resummed in the heavy quark PDF . In the massive scheme 
instead, the computation is more complicated due to the presence of massive final states 
and higher multiplicity; however, the full kinematics of the heavy quarks is taken into 
account already at the leading order and can be accurately studied via a next-to-leading 
order computation. In the latter case, the implementation in parton shower codes is also 
straightforward as no complications due to the inherent arbitrariness in the description 
of massive effects arise. The downside is that possibly large logarithms developing in the 
initial (and also the final state) are not resummed and could lead to a poorly-behaved 
perturbative expansion. In this work we will argue that in fact, at hadron collider, this is 
hardly the case for a large class of processes/observables, and such resummation effects are 
quite mild in general and often smaller than other approximations inherent in fixed order 
calculations. 

To all orders in perturbation theory the two schemes can be defined such as to be 
exactly identical. On the other hand, the way of ordering the perturbative expansion 
is different and at any finite order the results might not necessarily match. This is the 
case for the many 5F improved schemes available which typically differ order-by-order in 
perturbation theory due to the different possible choices in including subleading terms. 
For some processes the difference between leading order calculations performed in the two 
schemes may be very significant and in extreme cases have yielded to (only apparently) 
puzzling predictions differing up to an order of magnitude as in the case of the 6-initiated 
Higgs production [9, 30, 31, 37-39]. 

In this work we critically reconsider the motivations for using 5F or massless improved 
schemes in which potentially large logarithms are resummed. Our aim is not to analyse in 
detail the technical differences between the various approaches to 5F improved schemes, 
but to assess the relevance of the large logarithms in first place for processes of interest 
to LHC phenomenology and the need or not to employ such schemes. Our argument is 
based on two main points. First on a simple yet quantitive study of the importance of the 
resummation effects in b distribution functions with respect to having only explicit leading 
and subleading logs at fixed order. Second on the systematic assessment of the size of such 
logs in 4F calculations. To this aim, we perform a thorough analysis of their origin and role 
in various processes initiated by a single bottom. We analyse in detail some representative 
processes that can be treated analytically and cover a broad spectrum of possibilities. In 



the single heavy quark category fah both lepton-hadron (e.g. DIS) and hadron-hadron 
colhder processes. The first category is mostly interesting for pedagogical reasons, and 
because it is the place where heavy flavor schemes have been profusely discussed in the 
literature. It might also be interesting in view of the LHeC electron proton collider [40]. In 
the hadron-hadron processes we consider both 2 — )• 2 processes like associate bottom and 
vector boson production and 2 — )• 3 processes like single top production. 

As a result, we will argue that at the LHC, the logs that are resummed in b distribu- 
tion functions, apart from extreme cases which we will clearly identify, are not very large 
because the possibly large ratios Q^/tti^'s are always accompanied by universal phase space 
suppression factors at all orders in perturbation theory. Such logs, therefore, do not spoil 
the convergence of the perturbative series of the 4F flavor scheme whose results are in gen- 
eral well-behaved. 5F schemes calculations, on the other hand, do typically display smaller 
scale uncertainties leading to more accurate predictions for very inclusive observables, such 
as total rates. However, for differential distributions and more exclusive observables their 
predictions might be less accurate, technically more involved and in general less suitable 
to theoretical-experimental comparisons with respect to those performed via NLO compu- 
tations in the 4F scheme interfaced to parton showers, employing MC@NLO [41] or the 
POWHEG [42] methods. 

The paper is organized as follows. In Section 2 we provide a brief overview on the 
various possibilities available in the literature for the implementing 5F (improved) schemes. 
The section is meant for a reader who is not familiar with the subject and would like to learn 
the basic concepts /techniques as well as where to find more details in selected references. 
In Section 3 we show that, apart from extreme kinematical situations, the resummation 
effects in the 6-distribution functions are mild compared to the contributions from the 
explicit leading and subleading logs present at fixed order in 4F scheme calculations. In 
Section 4 the DIS case is discussed. This allows to set the stage and illustrate our analysis 
approach in a simple and very instructive case, where log ^-'m'^ are clearly identified and 
whose relevance can be easily studied as a function of the kinematical setup. In Section 5 we 
finally attack processes at the LHC and show that initial collinear logarithms are typically 
not very large. The last section presents a summary together with our conclusions. 

2 Heavy quark schemes: a short review 

Processes involving heavy quarks often provide a good example of multi-scale processes. 
First, let us define a heavy quark to be one whose mass m is large enough that the effective 
coupling at the scale of a heavy quark mass as{m'^) is in the perturbative region. With 
this definition, the bottom and top quarks are definitely heavy quarks. The charm quark 
mass, on the other hand, is rather close to the boundary region and QCD perturbative 
corrections at the scale of the charm mass are generally large, and can only be taken as an 
indication. 

Whenever a cross section is characterised by two different scales, here the hard scale 
Q and the mass of a heavy quark m, perturbative calculations typically display both log- 
arithms and powers of the ratio m'^/Q'^. The former contributions may spoil the accuracy 



of the calculation if Q ^ m. It is easy to trace back the dynamical origin of such logs to 
the process of gluon splitting into a heavy quark-antiquark pair. Such splitting can take 
place either in the initial or in the final state. As already mentioned, final state logs can 
be resummed into fragmentation functions or cancelled by defining more inclusive observ- 
ables such as heavy-quark jets [19]. In the following, we focus on the logarithms arising 
from initial state gluon splitting. By defining a heavy quark parton density, one would 
automatically resum the logarithms to all orders in perturbative QCD via the Dokshitzer- 
Gribov-Lipatov-Altarelli-Parisi (DGLAP) evolution equations: when computing the heavy 
quark parton densities from the 1-loop DGLAP evolution equations, all possible multiple 
collinear gluon emission from the initial state are in effect summed up to all orders in Og in 
the leading-log approximation. However, this approach is only adequate at large Q, where 
the mass of the heavy quark is small with respect to the typical scale of the process. At 
smaller Q, powers of log(Q^/7TT,^) do not spoil perturbation theory, and power corrections 
rr? I<^ need to be consistently included. 

Theoretical predictions in QCD have been performed according to a variety of schemes 
for dealing with heavy quark masses. We restrict our discussion to two (classes of) schemes 
for dealing with the h distribution, which we denote by 5F or massless and 4F or massive 
schemes. Technically, our definition of the 4F scheme corresponds to the so-called Fixed 
Flavor Number (FFN) scheme with nj = 4. In this scheme the bottom quark appears 
only among the final state particles, and is not associated to a PDF. Calculations in the 
4F scheme are meant to provide reliable results when the scale of the process is not much 
larger than m. At any finite order in a perturbative calculation, the FFN result is expected 
to break down as the scale becomes large compared to m. 

Natural extensions of the FFN scheme are the so-called Variable Flavor Number (VFN) 
schemes, which consist of a sequence of n/-flavor FFN, each in its region of validity, con- 
sistently matched at the transition thresholds. There are, however, several ways of imple- 
menting a VFN scheme. The simplest implementation of the VFN scheme is the zero-mass 
(ZM) approximation, where all quarks are treated as massless. Heavy quarks are absent 
at scales /u^ < rr? ^ and they are radiatively generated at the transition points [j? = rr? by 
the subprocess g — )• QQ^ but apart fom this they are treated as massless partons. The ZM- 
VFN scheme is therefore a combination of massless MS schemes characterized by different 
numbers of light flavors n^. Basically the only mass effects are due to the change of number 
of flavors in the QCD /? function and in the anomalous dimensions as one crosses the heavy 
quark thresholds. The coefficient functions are calculated under the assumption that all 
active partons are massless, with the associated singularities subtracted in the MS scheme. 
This scheme becomes inaccurate for scales close to the thresholds. This does not depend 
uniquely on the fact that 0{m?' /Q}) terms are neglected in the coefficient functions, but 
also on the approximate treatment of the phase space and the arbitrariness in the definition 
of the scaling variable, i.e. the fraction of energy in the collision taken by the heavy quark. 
Due to the absence of mass effects, the ZM formalism is not a good approximation in the 
region near the physical thresholds. 

In the following, with 5F we indicate a scheme where the collinear logarithms appearing 
in each term of the FFN scheme computation are resummed to all orders in the heavy quark 



PDF, and where powers of m^/Q^ are consistently included at higher orders. These schemes 
are called General-Mass VFN scheme and all partons, including the bottom, are associated 
to a parton distribution function above threshold. The GM-VFN scheme represents an 
improvement over the ZM-VFN scheme. However, it also introduces new ambiguities with 
respect to the shifting of higher-order terms into the lower-order expressions. In the GM- 
VFN schemes the mass of the heavy quarks is taken into account in the partonic cross 
sections, and the schemes are designed to interpolate between the FFN scheme, which 
gives a correct description of the threshold region, and ZM-VFNS which accounts for 
large energy logarithms. The first proposed matching technique for the inclusion of mass- 
suppressed contributions, built upon the CWZ renormalization scheme [43], was developed 
long ago; it is the so-called ACOT scheme [44]. It provides a mechanism to incorporate 
the heavy quark masses in theoretical calculations both kinematically and dynamically. 
It yields the complete quark mass dependence from the low to high energy regimes; for 
m ^ Q it ensures manifest decoupling, and in the limit m <^ Q it reduces precisely to 
the MS scheme without any finite renormalization term. Several variants of this method 
were subsequently proposed, such as S-ACOT [45] or ACOT-x [46], which include some 
improvements and/or simplifications with respect the original ACOT scheme. A different 
approach, the Thorne- Roberts [47] VFN scheme or TR' [48] in its latest version, emphazises 
the correct threshold behaviour and includes higher-order terms in order to smoothen the 
function at the transition points. Namely, the NLO computation includes 0{a'^) terms, 
which formally belong to the next-to-next-to-leading order, which are introduced in order 
to cure the discontinuities of the physical observables at the heavy quark mass threshold. 
An alternative approach considers both massless and massive scheme calculations as power 
expansions in the strong coupling constant, and replaces the coefficient of the expansion in 
the former with their exact massive counterpart in the latter. A first step in this direction 
was taken in Ref. [49]. More recently, another method, called FONLL, was introduced in 
Ref. [50] in the context of hadroproduction of heavy quarks, and was recently applied to 
deep-inelastic structure functions in Ref. [51]. Most of the up-to-date parton sets [52-56] 
implement one of the above inentioned matched scheme as a default scheme. On top of 
that, each collaboration also provides FFNS parton sets with nj = 3, 4, 5. Other PDF 
fitting collaborations [57, 58] provide as default sets FFNS parton sets with nj = 3, 4, 5. 
At a fixed order n in perturbation theory, the difference between these schemes amounts 
to adding different 0(a"^^) higher-order terms, therefore the difference is reduced as one 
increases the perturbative order. A benchmark comparison is available in Ref. [59]. 

3 Impact of resummation 

In the previous section we have distinguished the 4F schemes, where 6-PDFs are not present 
and collinear logarithms of the mass of the bottom are not resummed, and the 5F schemes, 
where instead this class of logarithms is resummed. The question we address in this section 
is the following: 

• What is the typical size of the effects of the resummation of initial state collinear logs 



of the type log Q'^ /rn^ with respect to an approximation where only logs at a finite 
order in pertubation theory are kept? 

While a detailed answer clearly depends on the kinematic regime considered, it is interesting 
to explore the possibility that general features might be present at colliders in general and 
at the LHC in particular. Quite surprisingly, while already addressed in the past in several 
specific contexts, to our knowledge it has never been answered for 5-PDF's. 

In order to assess the relevance of the resummation of logarithms of ^ /m? performed 
in the 5F scheme, one must compare the exact solution of DGLAP equations, and its 
expansion in powers of a^ up to a given fixed order. This comparison can be easily per- 
formed analytically at leading order. In this case the full h parton density is a solution of 
the evolution equation (in Mellin moment space) 
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where g{N,^'^) is the Mellin-transformed gluon distribution, 7^.. are the leading order 
Altarelli-Parisi anomalous dimensions, and the initial condition is b{N,m'^) = 0. This 
equation can be easily solved if the gluon scale dependence is neglected, an approximation 
which is valid at this order if A'^ (i.e., x) is not too small. A straightforward calculation 
yields 
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where /3o = 11 — |nj. This solution can be expanded in powers of as{m'^). Using the 
binomial expansion 
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and standard properties of the F function we find 
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Clearly, the resummation terms of order a^ and higher become relevant when the factor- 
ization scale n, typically of the order of the hard scale Q of the process, is much larger 
than 777-6 . We note, however, that the coefficients Ak{N) behave as log A^ at large A^, due 
to the large- A^ behavior Ijqq \ ~ logA^, originated by the soft pole in the Altarelli-Parisi 
splitting function. We therefore expect the effects of resummation to be more important 
as the momentum fraction z gets close to 1, which corresponds to the large- A^ region in 
Mellin space. 



As we will see in the following, this simple derivation nicely reproduces the qualitative 
behaviour of the evolution of the 6-PDF at medium to large x both at LO and NLO. 
However, the approximation of no-evolution for the gluon pdf is untenable at small x and 
therefore it cannot be used in this regime. In fact, as we will show in the following, even 
the exact evolution at LO is not reliable at small x and the NLO accuracy is needed. 

If no intrinsic bottom content of the proton is envisaged, then the bottom parton 
density which appears in the 5F scheme is generated at threshold (by setting b{x, m^) = 
as a boundary condition) by gluon and light parton distributions. As a result the bottom 
PDF is not an independent quantity: it depends on the four-light flavour {u,d,c,s) and the 
gluon {g) densities in the 4F scheme. Following the notation of Ref. [49] the bottom PDF 
can be written as a function of the light flavor singlet and the gluon up to 0{a1) as 
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where by b^^' we indicate the approximated h distribution that one obtains when truncating 
the perturbative expansion of the bottom PDF evolution at ©(af). The operator matrix 
elements A^l have been computed in the Mellin N— space in the asymptotic limit Q^ ^ m^ 
up to 0{a^g) and in the physical x— space up to 0{a1) in Refs. [49, 60-65]. In the physical 
x— space up to ©(a^), they read 
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where S = X^"i=i ili + ^«) ^'^^ '^s = o.s{lJ-\)- The explicit coefficients, taken from Ref. [49], 
are given in Appendix C. Note that in the above formula, the ^^' and '^^ superscripts indicate 
the perturbative order for both the perturbative matching coefficient and the PDFs. At 
LO, it is easy to check that h^^' is the first term in Eq. (3.4), namely 



In order to assess the accuracy of the 0{a\) and ©(a^) approximation, we plot in 
Fig. 1 the ratio b{xQ, fj,)/b{xo,n) as a function of // between mf, and 10^ x rrih, for several 
values of xq. Deviations from 1 of these curves are an indication of the size of terms of 
0{a1) {0{a^)) and higher which have been resummed in the QCD evolution of the heavy 
quark PDFs. 

We observe that at LO the higher order logarithms are important and that the b^^' 
approximation is up to 40% lower than the b distribution at large x. At small x, on the 
other hand, the ratio becomes larger than one, clearly suggesting the inadequacy of the 
LO approximation to perform a meaningful resummation. 



The NLO ratio, displayed in the middle and bottom plots of Fig. 1, clearly shows 
that the explicit collinear logs present in a 4F calculation at NLO already provide a rather 
accurate approximation of the whole resummed result at NLL, significant effects of order 
up to 20% appearing predominantly at large Bjorken x. At small x, the resummation at 
NLO is well behaved and predicts very small differences between b and b. It would be 
interesting to explore and understand more in detail the small x evolution of the 6-PDF 
and in particular the origin of such a large difference between LO and NLO results. While 
going somewhat beyond the scope of this paper, it is possible to qualitatively understand 
why the LO approximation appears too crude in this region. At small x, i.e. at small 
A^, the evolution is dominated by the rightmost pole of the anomalous dimensions, which 
is located at A^ = 1 because of the singularity at x = 1 in the Altarelli-Parisi splitting 
functions. This singularity appears in Pqg and Pqg only at NLO (while it is present in 
Pgg and Pgg already at leading order), and therefore its impact is missed by leading-order 
evolution. 

Our results are consistent with previous findings in the context of the c-PDF [49] even 
though for the charm PDF the effect of the resummation is more important, as it is also 
clearly shown in Ref. [66]. They also provide a very simple explanation to the somewhat 
counterintuitive fact that for 6-initiated processes, such as for example single-top [67] and 
Hbb [68] , total cross sections calculated in the 5F scheme differ with respect to the 4F ones 
more at the Tevatron than at the LHC. Naively, one could either expect the size of the logs 
to be independent of the collision centre-of-mass energy or were this not the case (as we will 
argue in the next section) to have more phase space available at higher energy to 'develop' 
large logs and therefore effects of resummation to be more important the higher the collider 
energy. The dependence on the Bjorken x of the resummation and in particular the larger 
effects present at higher x, on the contrary, do point to more important differences at small 
collider energies, as effectively seen. 

4 Heavy quark production in lepton-hadron collisions 

The production of heavy quarks in lepton-hadron scattering is one of the first processes 
for which the choice of the heavy flavor schemes has been extensively analysed from both 
the theoretical and the experimental points of view. For a broad overview see Refs. [44, 
46, 49, 51, 69-75] and references therein and the experimental analyses performed by the 
HI and ZEUS collaborations [76-82]. 

The relevant cross section can be expressed in terms of the structure functions usually 
defined in the context of deep-inelastic scattering. The contribution of heavy-quark final 
states to DIS structure functions has been historically computed in the massive scheme, 
where a heavy quark-antiquark pair is produced via vector boson-gluon fusion, as shown 
in the left diagrams of Fig. 2. 

Alternatively, one can adopt a variable flavor number scheme, whose leading-order 
contribution is shown in the right diagram of Fig. 2. The main difference between the two 
production mechanisms can be attributed to the fact that for massive heavy quark produc- 
tion the quark and the anti-quark are produced in pairs, while in the second approach only 
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Figure 1. Ratio b/b for several values of x as a function of the scale /i. The 4F-FFNS and GM- 
VFNS are associated to the b and b PDF computations respectively at LO (top) and NLO order 
(centre and bottom) for the MSTW08 [54] and NNPDF21 [52] parton sets. 

one heavy quark is produced at the leading-order. This difference reflects in the transverse 
momentum distribution of the bottom quark. At NLO, however, these striking differences 
are milder as, for instance, the gluon splitting process appears also in the massless scheme. 
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n — flav scheme 




(n + 1) — flav scheme 




Figure 2. Leading-order diagrams for heavy quark production in a n-flavor (left) and (n+l)-flavor 
(right) schemes. 



4.1 Bottom Antibottom production 

The total cross section for the process of heavy quark production in DIS can be written 



'76(/^^) 



ymax /"Vmax 



2'Kaiah 






[l + {l-yf]F^{x,Q\ml) 
y'Fl{x,Q\mt)+ [I - {I - yf]Fl{x,Q\ml)\ , (4.1) 



where ai,ah are the lepton and quark coupling constants to the exchanged vector boson, 
M its mass, Q^ its virtuality, and 



y 



xS^ 



(4.2) 



where S is the centre-of-mass energy squared. The functions -F^LS ^^^ ^^^ contributions 
to deep-inelastic scattering structure functions coming from bottom quark production. A 
dependence of the total cross section on renormalization and factorization scales arises 
because the calculation is performed at a finite order in the QCD perturbative expansion. 
We start our analysis by comparing the scale dependence of the 4F and 5F calculations. 
For simplicity, renormalization and factorization scales are taken to be equal. We choose 



fJ'F = fJ-R = kJQ^ + 4m 



(4.3) 



and we let k vary between 0.5 and 2. We have studied the scale dependence of both 4F 
and 5F results for two different experimental configurations: the HERA Run II and one of 
the possible future LHeC scenarios. At HERA the beam energies are Ep = 920 GeV and 
Eg = 27.5 GeV, and the kinematical cuts on the outgoing electron are set to 

Q2 > 20GeV^ 0.05 < y < 0.7 
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as in Ref. [81] In the LHeC scenario the energy of the proton beam has been set to Ep = 7 
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DIS bottom production (LHeC) 
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Figure 3. Comparison between 4F anti 5F production for 56 production at HERA run H (top) 
and LHeC (bottom). Input PDFs: NNPDF2 . 1_FFNJJF4 (LO and NLO) for the 4F scheme (run with 
HVQDIS) and NNPDF2. 1 (LO and NLO) for the 5F scheme (run with internal code). Parameters: 
mi, ~ 4.75 GeV, k = [i-l \/Q'^ + 477i^, with [i = [ip = /i/j. 

TeV and that of the electron beam to -Eg = 50 GeV. The cuts in y and Q^ are set to 

Q^ > 2000 GeV^ 0.1 < y < 0.9 

The results are shown in Fig. 3. The 4F scheme curves are obtained by running the code 
HVQDIS [83], which is based on the calculation of Ref. [84]. The 5F curves have been 
computed using the results presented in Ref. [51]. 
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In the HERA II configuration the flavor excitation (the LO 2—7-2 process) is consider- 
ably larger than the other predictions, being completely driven by the b quark PDF, which 
does not provide any transverse momentum for the produced bottom quark. At NLO, the 
Zero Mass prediction (where mi, is set to everywhere except in the boundary conditions for 
the evolution of the b PDF) displays a sizeable scale dependence, even though it is milder 
than in the pure flavor excitation. The matched prediction, computed according to the 
FONLL-A scheme, where the collinear logarithms are resummed and mass effects are also 
taken into account, shows a mild dependence on the scale and it is close to the predictions 
of the 4F scheme. This indicates that, in the low Q^ region covered in this experimental 
configuration, finite-?7ib effects dominate over the effects due to the resummation of the 
logarithms, which we expect to be very small. 

In the LHeC configuration, where much larger Q^ scales are probed, the scale depen- 
dence of the 4F LO prediction displays a scale dependence which is comparable to the one 
of the 5F LO prediction, the former being driven by the a^ scaling and the second being 
driven by the bottom PDF evolution, which go in opposite directions. At NLO the ZM 
and the FONLL predictions in the 5F scheme are very close to each other, thereby showing 
that the size of finite-mb effects are much smaller at such high scales. On the other hand, 
as compared to the 4F NLO prediction, they display a milder scale dependence. This tells 
us that in this kinematic region the resummation of the collinear logarithms dominates 
over the finite rrih mass effects. 



4500^ 
4000 H 
3500 H 
3000^ 
2500 H 
2000 H 
1500 H 
1000 H 
500^ 



lO" 




HERA 









5000 


r 


/ \ LHeC 


4000 


r 


I 1 


3000 


r 


1 \ 


2000 


r 


\ 


1000 


r 


V 



Figure 4. Distributions of events as a function of momentum fraction carried by the bottom quark 
in the LO 5F eb —J- eb process, in the HERA (left) and LHeC (right) configurations, hiput PDF: 
NNPDF21 (LO) 



In Fig. 4 we plot the distribution of the momentum fraction z carried by the bottom 
quark in the 5F process. We see that the distribution has a maximum at z ~ 10""^ for the 
HERA II kinematics, while the peak shifts to larger values, z ~ 10^^, in the LHeC case. 
Comparing to the plots in Fig. 1, we conclude that indeed at the LHeC the b approximation 
works much worse than at HERA, not only because of the larger energy scales probed in 
the latter kinematic region, but also because the relevant values of the parton fraction z 
are comparatively larger than in the case of HERA II. 

We now turn to the problem of the scale choice for collinear emission. The explicit 
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expression of the LO partonic cross-sections for bb production in DIS is reported in Ap- 
pendix A. The partonic cross-sections da2, relevant for the structure function F2, displays 
logarithmic contributions both in the t— and it— channels, which become large as ruf, — )• 0. 
In order to see this explicitly, and to compare the results in the 4F and 5F schemes, it 
is useful to rewrite Eq. (A. 16) as a Laurent expansion around t — m^ = in order to 
isolate the collinear singularity, and further neglect terms which are suppressed by powers 
ofm^/Q^. We obtain 
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Az 



+ (1 



Q2(t 



ml) 



+ non-singular terms 



(4.4) 



where z = Q"^ /{s + Q"^). The rhs of Eq. (4.4) is immediately recognized to be proportional 
to the Altarelli-Parisi splitting function Pqg{z) as reported in Eq. (C.4). The contribution 
of the collinear region to the cross-section takes the form dictated by the factorization 
theorem: 



dt' 



dd-2 
'dF 



4Q2 
2Q2 



ZPgg{z) log 



1 + /3 
1-/3 



^zP 



(^) 



log- 



mr 



+ 



mt 



(4.5) 



where we have used 



*± 



ml 



s + Q^ 



[i±f3y, (3 



iml 



(4.6) 



Thus, as expected, the contribution of the collinear region is proportional to 



g^..(^)log;^ = iA.(-)log 



qm 



mt 



(4.7) 



In a leading-order 5F computation, only this logarithmically divergent contribution is taken 
into account, and resummed to all orders in the evolution of the b parton density. The 
argument of the collinear logarithm in the 5F scheme is -^ , and the factorization scale /i^ 

is chosen to be Q^. We now note that in the full massive calculation, Eq. (4.7), the scale 
is not simply proportional to Q^, but rather to the invariant mass of the produced bb pair 



Ldis = log 



QM 



mt 



log 



^4 



m 



2 • 



(4. 



The scale of the logarithm in Eq. (4.7) is a dynamical scale which changes on an event- 
by-event basis depending on the momentum fraction carried by the gluon and on the 
kinematic invariants Q^ and z. (This is well known for example in the context of Sudakov 
resummation [85], where Q'^{1 — z) is identified as the characteristic energy scale of soft 
emission.) 
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Therefore, to make any consideration about its size, for a given collider energy and 
acceptance, one has to check what is the distribution of (1 — z)lz. In the case this distribu- 
tion is peaked in a region which is much smaller than one, then the logarithms of (M^r/m^) 
are not in fact large, even when Q^ ^ ni^. If, on the other hand, it is close or even larger 
than one, logarithms should be resummed. 




5 10 15 20 25 30 35 40 45 50 







10 12 



Figure 5. Distributions of events as a function of Q (left), ^!-^ (centre) and log(s/m^) (right) for 
HERA II (top) and LHeC (bottom). In left and centre plots, 50% (80%) of events are to the left of 
the vertical solid (dot-dashed) line. Input PDF: NNPDF21_FFN_NF4 (LO). 



In Fig. 5 we plot the distributions of Q, ^— ^ and the coUinear logarithm. From the plot 
of the Q distribution we see that the two experimental configurations we are considering do 
really explore two complementary regions: for HERA II the bulk of events lies in the region 
Q ^ "^6 5 while at LHeC Q is typically much larget than ra\). The event distribution in the 
kinematic factor -^^ is also rather different in the two cases. For HERA II, 80% of events 
are in the region (\ — z)lz < 10. This means that the scale of the logarithm is sizeably larger 
than Q"^ /rn^ for a large fraction of events. Therefore, even if the experimental cuts are such 
that Q^ lies in a region where log(Q^/?72.^) is very small, the effect of these logarithms is 
enhanced by a scale which is effectively up to ten times larger than Q^. As a result, in this 
case the logarithms are not as negligible as one would expect from the typical size of Q^. 
On the other hand, in the LHeC kinematics the prefactor -^ is peaked around a value 
smaller than one. This means that in the LHeC configuration the scale associated to the 
collinear splitting is slighlty smaller than Q^ but not dramatically and that therefore the 
logarithms which are resummed in the bottom PDF are significant as one would expect by 
looking at the Q^ distribution. 

5 Hadron— hadron collisions 

In this section we discuss 6-initiated processes at hadron colliders. Such processes fall in 
two wide categories depending whether they can be described by one or two Vs in the initial 
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state. Associated production of a 6 plus a jet, a W or Z boson, a Higgs boson, t-channel 
and associated tW and tH~^ single top production are examples of the first class, while Zbb 
and Hbb production of the second. For the sake of the argument, in this work we mainly 
focus on one-6 initiated processes. In fact, as we will argue in the final discussion, two-6 
initiated processes can be analysed following the same lines providing, for instance, a useful 
framework to decipher well-known apparent discrepancies between 4F and 5F predictions 
for pp — )• Hbb production [68] . 

We will consider specifically W production initiated by a bottom quark, and single top 
production. The large energy scale associated to these processes is the invariant mass of the 
final state, rather than the high virtuality of the incoming particles, at variance with DIS. 
As we will prove in the following this makes a difference in the phase space available for 
the spectator b quark or extra radiation in general. In addition, single top provides already 
all the kinematical complexity needed to obtain a general expression for the initial-state 
collinear logarithm, a form that can then be used for any process of this type, including 
those involving two 6's in the initial state. 

We first analyse the associated production of a heavy quark with a gauge boson or a 
charged scalar. In the 5F scheme these processes start with the simplest possible topology, 
i.e., 2—7- 1 at Born level, and feature a b quark as one of the initial-state partons. While 
not really of prime phenomenological importance as for themselves, 4F and 5F scheme 
predictions can be easily compared at NLO, and provide a very simple baseline for studying 
more complex final states. 

We then turn to the analysis of t-channel single top production. This probes directly 
the charged-current interaction of the top quark and it is therefore sensitive to possible 
new physics associated to the charged-current weak interaction of the top quark. A full 
calculation of the NLO corrections to Vl^~gluon fusion has been already presented in the 
literature [86-88], and several schemes have been used. The calculation was originally 
performed [89, 90] in the 5F scheme. More recently [6, 67] the 4F scheme was adopted in 
a NLO calculation where a first comparison between 4F and 5F results was performed. 

In the last subsection we generalise the results obtained by analysing t-channel single- 
top and present formulae that can be applied to any 6-initiated process, with either one or 
two 6's in the initial state. 

5.1 Associated W and bottom production 

Associated production of a VF with heavy quarks, such as, charm and bottom, is of great 
phenomenological importance at the Tevatron as well as the LHC. The production of aW 
boson and a charm quark, for example, can provide very useful information on the strange 
content of the proton; it has been subject of measurements at the Tevatron [91] and it is 
being explored now at the LHC [92] . Wb associated production is also of phenomenological 
interest, yet its main production mechanism at hadron colliders does not involve a 6 in the 
initial state, since at leading order it amounts to W Drell-Yan production with an extra 
gluon splitting into a bb pair. 

In this subsection, we consider the simple 2 — )• 1 processes ub ^ W~^ and ub ^ W~ . 
This is to be considered as a simple alternative, yet fully analogous to, the sample process. 
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bt —7- H^ considered in the seminal work by Olness and Tung [93]. The main difference 
here is that we focus on the b quark and consider a valence-hke PDF in the other initial- 
state leg, ignoring the suppression originated by the small CKM coupling. In this way, our 
analysis is not affected by the presence of a sea quark in the initial state. 

We first compare the size and the scale dependence of the 4F and 5F predictions at 
leading and next-to-leading order. Our results are shown in Fig. 6. The 4F curves are 
obtained using the MCFM code [94], while we have computed the 5F cross sections. We 
have used the NNPDF2.1 family of parton distribution functions. In particular the default 
GM-VFN scheme-based set was used for the 5F calculation, and the 4-FFN scheme set 
was used for the 4F computations. We have considered both the case of the production 
of a real W boson (first plot in Fig. 6), and that of a virtual one, for two different values 
of the invariant mass of its decay products, namely 400 GeV (second plot) and 800 GeV 
(third plot). We take the renormalization and factorization scales to be equal to kMw, 
with 0.1 < A: < 3. 

We observe that, for large values of the invariant mass of the W decay products, the 
5F cross sections is typically larger than the 4F prediction at the central value k = 1. The 
difference decreases with decreasing k, and the crossing point is around k ~ 0.1. 

These features can be qualitatively understood by studying the explicit expressions of 
the leading-order cross sections in the two schemes. At leading-order, the relevant partonic 
subprocesses for the 5F calculation are 

b{ki) + u{k2) ^W{k) (5.1) 

The corresponding cross-section is 

a'''{T) = ^GpTC^tir,fil). (5.2) 



where r = M^/S*, S is the hadronic centre-of-mass energy squared, and we have set 
Vub = 1- The parton luminosity is given by 

CubiT,HF) = 2 —u'^(^-,fipjb{z,fip), (5.3) 

where u"^ = u + u. As explained in Section 3, the heavy quark parton distribution function 
b{z, fi'jp) arises dynamically as a consequence of Altarelli-Parisi evolution, which resums the 
leading powers of log(^^/m^) to all orders in perturbation theory. If only the first order 
of this resummation is retained, one obtains an approximate form for the 5F cross section: 



,5F 



<T»'(r) 



(^°-) /' T ^- ii-^'-) £^»(^) '°4 + ° (°2 '°«^ ij ■ (^■^' 



The choice of factorization scale will be discussed later in this Section. Here we recall from 
Section 3 that, for fixed fip, the size of the neglected terms in Eq. (5.4) is smaller, the 
smaller is the momentum fraction carried by the b quark. It is therefore interesting to 
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Figure 6. Comparison between 4F and 5F production for 6-initiated W production at LHC 14 
TcV, as a function oi k = [ijMw, with ^ == fip = ^^, for Mw = 80.4 GeV (top), Mw = 400 
GeV (middle), Mw = 800 GeV (bottom) and mb=4.75 GeV. Input PDFs: NNPDF21_FFNJfF4 and 
NNPDF21 for 4F and 5F respectively. 
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study the distribution in this variable, for different values of the virtuality of the produced 
W boson. This distribution is displayed in Fig. 7. We note that the dominant contribution 






Figure 7. Distribution of the momentum fraction of momentum carried by the b quark(antiquark) 
in W production initiated by bottom at LHC 14 TeV, LO 5F scheme, for Mw = 80.4 GeV (left), 
Mw = 400 GeV (centre), Mw = 800 GeV (right). Input PDFs: NNPDF2.1 (LO) 

to the convolution integral in Eq. (5.3) comes from very small values of the momentum 
fraction, ranging from 10~^ to 10~^ in an order of increasing values of M\y. On the basis 
of the discussion in Section 3, we conclude that the impact of resummation from Altarelli- 
Parisi evolution is in this case rather small indeed, apart from very high virtuality of the 
W. 

The factorization scale in the 5F calculation is typically chosen to be the hard scale 
characteristic of the process, in our case Mw. This is nothing but an order-of- magnitude 
indication. A less vague indication is given by the comparison with the 4F computation, 
which we now illustrate. In analogy with the previous case, we consider the subprocesses 



(5.5) 







g{pi) + uip2) -^ b{p3) + W{p4) 


The total 


cross-section is 
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where r = 
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s = {Pi +P2)^ and 






a'^{z) 


= / at—-{s,t,as). 



(5.6) 



(5.7) 



The variable t = {pi — ps)^ represents the collinearity of the bottom pair produced by gluon 
splitting, and the kinematical bounds are given in Eq. (A. 27). The Cug parton luminosity 
is defined in Eq. (5.3). 

The partonic differential cross section is given in Eq. (A. 25). In order to compare the 
results in the 4F and 5F schemes, it is actually useful to rewrite Eq. (A. 25) as a Laurent 
expansion around t — ml = in order to isolate the collinear singularity, and further neglect 
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terms which are suppressed by powers of mf^/M'^. We find 



da asGp -2M6 + 2M^s - M^ s2 



dt 6^253 
Since, in this hniit, 



t — ml 



+ 






+ non-singular terms . 



the total partonic cross section is 

a^^(z) = ^4^{2M^ - 2M^s + M^s') log 






(5.8) 



(5.9) 
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(5.10) 



where O(mfj) stands for constant or vanishing terms in the limit rrn, — t- 0. Recalling 
Eq. (5.6), we can rewrite it as 



.-(,) . (,|o..) /' 
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with 



Ldy = log 



M^(l 



z)^ 



mr 



(5.11) 



(5.12) 



In order to assess the impact of neglecting non singular terms and the b mass, we have 
computed the total cross section both with the exact and approximated partonic cross 
section. The results are o"'"* = 43.7 nb and o"*°* = 37.6 nb respectively. In Fig. 8 the 



density plot of the cross-section is displayed as a function of 1 — z and 



t+-t 



It shows that 



indeed most of the cross section lies in the small-t region, while the points are uniformely 
distributed in 1 — z. In particular they are not concentrated in the threshold region z ~ 1, 
clearly showing that collinear splitting g ^ bb is not a soft effect. 

It is now easy to compare Eq. (5.11) with the 5F computation in the approximation 
of Eq. (5.4). Equations (5.4) and (5.11) have the same structure; they only differ in 
the argument of the collinear logarithm, which has a fixed value in the case of the 5F 
calculation, while it depends on the momentum fraction z in the 4F result. In order to 
estimate the size of the dynamical scale, we have computed the distribution of (1 — z)^/z, 
which is the suppression factor in the argument of the collinear logarithm with respect to 
Mw/rrib. The result is shown in Fig. 9. We see that this distribution is indeed peaked 
around values which are sizably smaller than one, and that the peak is shifted to smaller 
values with increasing M^. 

As a confirmation of the conclusion arising from inspection of Fig. 9, we observe that 
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Figure 8. Density plots of tlic liadronic cross-section of the leading order 4F Wb production at 
LHC 14 TcV as a function of 1 — z and (t_|- — t)/{t+ — t-). The straight lines represent the median 
of the distributions in x and y, respectively. 
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Figure 9. 6-initiated W production at the LHC 14 TeV. Mw = 80.4 (left), Mw = 400 GeV (centre) 
and Mw ~ 800 GeV (right). Upper plots: Distribution of events (in pb/bin) as a function of 
2dy(-^)/-^^W' ~ (1 ^ ^)^/^- 50% of the events he on the left of the vertical sohd line, 80% of the 
events on the left of the vertical dot-dashed line. Lower plots: Distribution of events (in pb/bin) 
as a function of log ^/\t\/Mw ■ 



the two calculations are seen to give the same result for fip = /Ip, such that 
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For y/S=U TeV and Mw = 80.4 GeV we find 

/2F = 33GeV (/If = 42 GeV), (5.14) 

where the result in parenthesis is obtained by using the exact partonic cross section for the 
4F calculation, i.e., keeping the exact rrih and t dependence. Both results point to a value 
remarkably smaller than Mw, i.e. in the range ^p — [0.4,0.5] M^. This reduction is even 
more pronounced at larger values of the W virtuality: 

Mw = 400 GeV , flp ^ [0.3, 0.4] Mw ,^ ^^. 

Mw = 800 GeV , flp ^ [0.25,0.35] Mw- ' 

The above results clearly suggest that "fair" comparisons between 4F and 5F calculations 
should be performed at factorization scales that are in general smaller than a naive choice. 
This fact had been noticed previously in several studies concerning 6-initiated processes, 
see for instance [30, 37, 68, 95, 96]. 

Finally, it is instructive to plot the differential cross section as a function of log y^Z-^VK 
(lower plots of Fig. 9), i.e., tda/dt. For a massless quark one expects a plateau to develop 
at small t corresponding to the collinear pole da/dt oc 1/i. For finite b quark the collinear 
divergence is regulated by the quark mass and the distribution goes to zero (a.k.a. dead 
cone). The drop of the plateau at larger t can be taken as signaling the end of the collinear 
enhanced region, i.e., where the 2—7-2 hard scattering dominates. We observe that in 
general the collinear plateau drops at scales smaller than Mw for the production of on- 
shell W bosons, in line with the results found above for the factorization scale. The scale 
associated to the splitting of the gluon is softer that the scale associated to the hard W 
production process, which is naturally given by Mw- This implies that the factorization 
scale that should be set for the bottom PDFs in the four-flavor scheme is smaller than 
Mw, as suggested in several earlier studies. The considerations in this subsection help to 
understand the dynamical origin of this fact. 

5.2 Single top production 

We now consider the production cross-section for events with a single top quark in the final 
states in hadron-hadron collisions. It was shown in Refs. [6, 67] that the central values of 
the cross section predicted at next-to-leading order according to the 4F and 5F schemes 
differ by 5% or less, both at the Tevatron and at the LHC, by setting the factorization 
scale around the mass of the top quark (or the invariant mass of its decay products). At 
the Tevatron, the difference is well within the combined uncertainty from higher orders and 
PDFs, while at the LHC (10 TeV) the consistency was found to be marginal. For larger 
masses, i. e., for t' production, the differences were found to be much larger. For a t' of 
mass of 1 TeV, the 2—7-2 prediction is almost twice as large at the Tevatron and 20% larger 
at the LHC. Therefore, for such large top masses it could well be that the logarithm that 
is implicitly resummed in the bottom quark distribution function might become relevant, 
or that an even smaller factorization scale should be used. Here we further investigate this 
process by using the same procedure adopted in the previous section. 
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Figure 10. Comparison between 4F and 5F production for single top at LHC 14 TeV as a function 
of /c = lijMt, witli ^t = [LF = liB., for Mt =172.5 GeV (top), Mt = 400 GcV (middle), Mt = 800 
GeV (bottom) and m;, = 4.75 GeV. Input PDFs: NNPDF21_FFNJJF4 and NNPDF21 (LO and NLO) 
for 4F and 5F respectively. 
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In Fig. 10 the scale dependence of the total cross section for the single top production in 
the t-channel is displayed for both the leading and the next-to-leading order computations. 
For this process the scales are varied about the mass of the top. On top of the on- 
shell top quark case Mt = 112. h GeV, we consider also two cases where the top has 
a higher virtuality: Mt = 400 GeV and 800 GeV. All curves have been produced by 
running the MCFM code. In particular, the ACOT computation for the 5F scheme is 
obtained by matching the LO massive computation with the NLO massless. The input 
PDFs belong to the NNPDF2.1 family [52]. The set associated to the 5F scheme is the 
defauh NNPDF2.1 NLO set, based on a FONLL-A scheme which at NLO is equivalent to 
the ACOT scheme [51]. The set associated to the 4F scheme is the NNPDF2 . 1_FFN_NF4 set 
which is based on a 4F fixed flavor number scheme. 

In both figures we observe that the LO curves have an opposite behavior, being the 
5F curve driven by the scale dependence of the bottom PDFs at the relevant z where the 
fraction of the momentum is peaked, while the 4F one is driven by the running of q^. We 
further observe that both the leading order and the next-to-leading curves obtained in the 
two schemes are closer to each others for a value of the factorization and renormalization 
scales which is about half of the mass of the top quark. For heavier top, the scale depen- 
dence of the 4F computation deteriorates and the scale at which the two computations get 
closer decreases as increasing the value of the mass of the top quark. 

We first consider the leading-order computation in the 5F scheme. The relevant par- 
tonic subprocess is 

W*{q) + h{p)^t{k), (5.16) 

and the corresponding amplitude is easily computed: 

M^ = ^u{k)^^'PLu{p). (5.17) 

v2 

As explained in Appendix A, the contribution to the structure function F2 is obtained by 
applying a suitable projection operator on the squared amplitude, and integrating over the 
one-particle phase space: 



aY{z) = j d^i{p + q-,k) 



-Ir^l^^v^f 



m^m: 



2^ 2(s + Q2)2^ 

= 7rgl,6il-z), (5.18) 

where we have defined, in analogy with deep-inelastic scattering, 

_Mf + Q\ _, , ^2. ^2_ 2 



S + Q2 ' 

so that < z < 1, and we have used 



s = {p + qy; Q' = -q\ (5.19) 



dPk I'K 

d^iip + q;k) = ^^-^^{27,f5{p + q-k) = ^^^2-^(1 - ')■ (5-20) 
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Thus, at leading order the cross section (or better, the contribution to F2) from t production 
is simply proportional to the b parton distribution function. From Fig. 11 we conclude that 
the fraction of momentum carried by the b quark is peaked at larger value as the virtuality 
of the produced top increases. 
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Figure 11. Distributions of events as a funetion of the proton momentum carried by the incoming 
b quark in single top production at LHC in the 5F scheme, leading order. ^/S ~ 14 TeV and 
Mt = 172.5 (left), Mf = 400 GeV (centre) and Mt = 800 GeV (right). Input PDF: NNPDF21 (LO). 



In order to interpret correctly the scale dependence of the curves in Fig. 10 we consider 
the collinear limit of the massive leading-order computation. The full expression of the LO 
partonic differential cross-section da^ /dt is given by Eqs. (A. 8, A. 9, A. 10) with M = Mt, 
m = rub, qr = and gi = gw /V^- This specific term in the total cross sectio is the only 
one which has a pole in t = in the small rrih limit. Taking the limit rrih — )■ and keeping 
only terms which give rise to the collinear singularity we find 



dt 



3asg^,CF 
64(s + Q2^3 



(M^ + Q2)2 _ Ml(s + Q^) - Q^s + Q^) + {s + Q^)V2 

t — ml 



+ non singular terms. 



(5.21) 



Hence, in this limit 
dt 



*+ ,. daf _ Sasg^Cp 



dt 128(s + Q2)3 

[2M/ + 4Q2m2 - 2(s + Q^)M^ + s^ + Q^] log 



mt 



M} 



2\2- 



(5.22) 



With the definition Eq. (5.19), it is immediate to recognize that Eq. (5.22) can be written 
as 

(5.23) 



'--daf Sasg^^Cp z^ + {1 - zf ^ Q\z 
dt — -r— = TTTT — „o, log 



'-min 



with 



dt 



Q\z) 



64(s + Q2) 2 

(Mj + Q2)2 (1 - z)2 
^ Mi + (1 - z)Q2 z ■ 



m 



2 ' 



(5.24) 



This result can be interpreted as the scale corresponding to the collinear splitting not being 
Mt or \/M^ + Q"^, but rather the dynamical scale Q{z). As we will comment at length in 
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Figure 12. Single-top production at tlic LHC 14 TeV. Input PDF: NNPDF21_FFN_NF4 (LO). Top: 
Distributions of events as a function of \/Cp; the vertical blue line corresponds to Q = Alt, for 
Mt = 172.5 (left), Mt = 400 GeV (centre) and Mt = 800 GeV (right). Middle: Distributions of 
events as a function of Q^{z)/Mf, the vertical solid line corresponds to the median of the distribution 
and the dot-dashed line corresponds to the 80% of the distribution, for Mt = 172.5 (left), Mt = 400 



GeV (centre) and Alt = 800 GeV (right). Bottom: Distributions of events as a function of log -^^ 
for Aft = 172.5 (left), Alt = 400 GeV (centre), Mt = 800 GeV (right). 



the next subsection the expression above actually encompasses all cases discussed so far 
and paves the way towards the generalisation of our analysis to any process involving one 
or two b quarks in the initial state. The fact that the collinear logs in single-top production 
are in fact not very large can be also drawn by looking at the plots in Fig. 12, where cross 
section distributions are shown as a function of Q, Q^{z)/Mf , and log ^J\t\/Mt, in the first, 
second and third lines respectively. 

5.3 The general case 

Single-top production at hadron collisions is not only interesting per se, but also because 
it provides the simplest yet most general kinematics for any process involving one gluon 
splitting in the initial state: 



I{q) + 9{p) -^ h{k) + g{h) + ... + g{kn) + X{P), 



(5.25) 
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Figure 13. Generic process involving gluon splitting into bb pair in the initial state and the 
production of an heavy system P. Multi-gluon emission from the quark line gives rise to the 
collinear logarithms at all orders that are resummed into the b parton distribution function. 

where / is a generic initial state particle (whose virtuality can also be space-like, such in 
the case of DIS) and X is a generic heavy system, possibly made out of several particle, 
such that P^ > M^. The case of single-top production is obtained by identifying / with a 
virtual W boson with virtuality q^ = —Q^, and X with a top quark, so that P^ > Mf. In 
the case of Wh production, / is a light parton {q^ = 0) and X is a W , so that P^ > My^. In 
both cases, no light parton is present in the final state in the lowest order calculation of the 
4F scheme; we have included in Eq. (5.25) the possibility that the system X is produced in 
association with a collection of light partons with momenta ki, . . . ,kn, which is obviously 
the case for higher-order contributions to the cross-section. Multiple gluon emission from 
the b quark line gives rise to the collinear logs that are resummed in the 5-PDF's, Fig. 13. 
In the previous sections we have reconsidered the limit m^ — )■ of the lowest-order 
4F cross-sections, which have a collinear singularity due to the gluon splitting into bb pair 
in the initial state; in other words, the b quark mass acts as a regulator of the collinear 
singularity. Naively, one therefore expects a logarithm of the type 

, M2 + Q2 

inaivc = log 2 (5.26) 

i.e., of the hard scale to the collinear regulator, to develop in the integrated cross section. 
Large M and/or Q therefore motivate the resummation of such logs via the introduction 
of 6-PDF's and the use of the DGLAP equations. 

In fact, our analysis based on the lowest-order processes has shown that the logarithmically- 
enhanced contributions to the cross-sections are proportional to 

L = log^V' (5-27) 
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where Q'^{z), defined in Eq. (5.24), can be rewritten in tlie following revelatory form 



Q\z) = {M^ + Q^) 



{1-zy 



with 



1 



AP+Q'^ 



S + Q2 



(5.28) 



which encompasses in one expression all interesting cases previously discussed. Equa- 
tion (5.27) reduces to the log in DIS hb production in the limit M — )• 0: 



^Dis = log 



Q^l 



with 



Q' 



S + Q2' 

and to the log in the hadronic Wb associated production in the limit Q^ 

"M2(1-z)2" 



(5.29) 



Ldy = log 



mf 



with 



M2 

s 



(5.30) 



In the case of single top production, both M^ and Q^ are in general non-negligible, and 
the collinear log keeps its general form Eqs. (5.27-5.28). 

In this section we argue that the form Eqs. (5.27-5.28) is not only general enough to 
encompass all interesting processes at the lowest order, but also it is valid at higher orders, 
where powers of collinear logarithms are generated by multi-parton emission 



log 



,n Qn(^) 



mt 



(5.31) 



with 2^(2;) < Q^(-z) and for processes with two initial-state gluon splitting into 66. 

A complete argument based on the ladder kinematics and valid at any order is detailed 
in Appendix B. Here we provide an argument that while it has the disadvantage of not 
providing the exact expression of the cross section in the collinear limit, yet it illustrates in a 
very simple way the origin and universality of the kinematic terms appearing in Eq. (5.28). 
For the sake of the argument, we consider the case where X is a real heavy particle, such 
as a top quark or a TV boson. 

In the initial-state centre-of-mass system, the four-momenta of the particles in the 
process Eq. (5.25) can be parametrized as 



P 



s + Q\,^,s + Q^ 



2^s 



0,0, 



2^ 

S + Q2 



2yi ' ' ' 2^s 

kn = {oJn, 0, UJn sin 9, UJn COS 6) 
Pn = 



uj\ -I- M2,0, -Wnsin( 



-OJn COS ( 



(5.32) 

(5.33) 

(5.34) 
(5.35) 
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where Pn = P + k + ki + ... + K-i, P^ = M^ and 



t2 



since 

M^>M^. (5.37) 



We may now define 

M^ + Q^ 



(5.38) 



S + Q2 

so that the upper bound on the energy of the gluon with momentum kn can be written 

a;„ < -^ (5.39) 

where Q{z) is given in Eq. (5.28). As a consequence, the transverse momentum of each of 
the n gluons in the final state is bound from above by the same quantity. We therefore 
expect at most a factor of 

log^ (5.40) 

for each emitted gluon. This covers all processes that involve one gluon splitting in the 
initial state. In fact, given that no use on the details of the kinematics is made, the same 
argument can be equally applied to processes such as gg — )• Zbb or gg — )■ Hbb that involve 
two gluons splitting into bb pairs in the initial state. A detailed proof in terms of i-channel 
kinematics is provided for this case too in Appendix B. 

We conclude this section by noting that while the impact of the logarithms for processes 
with one or two g ^- bb splittings in the initial state is similar, the correspondence between 
LO, NLO,... 4F calculations with the LL, NLL,... accuracy of calculations in 5F schemes 
only applies to processes with only one splitting. This simple relation does not hold in the 
case of two splittings as for example in gg — t- Hbb. At LO, the 4F calculation displays the 
leading logs a^ log — ^ and part of the subleading ones, yet at NLO only one splitting at a 
time is correctly reproduced up to NLL the other being effectively only LL. In other words, 
one would need to go up to NNLO in the 4F scheme to correctly account for all NLL terms 
of each splitting at that order. This is easily achieved by the 5F calculation already at 
NLO. In this respect it is not surprising that 5F scheme calculations of total cross sections 
for bb fusion into Higgs or Z leads to very stable results under scale variations [73]. 

6 Conclusions 

A quite important set of processes that feature gluon splitting into a bb pair in the initial 
state are of phenomenological interest at the LHC. Accurate predictions for such processes 
can be obtained performing calculations in either the 4F and 5F schemes. Motivated by the 
somewhat unexpected result that in most cases predictions in the two schemes are found 
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to substantially agree within uncertainties if judicious scale choices are made, in this work 
we have addressed the following two basic questions: 



What is the typical size of the effects of the resummation of initial-state collinear logs 
of the type log -^ with respect to an approximation where only logs at a finite order 
in perturbation theory are kept? 



• What is the typical size of the log -^ themselves in phenomenologically relevant 
processes at hadron colliders, and in particular at the LHC? 

Our main conclusion is that unless the typical Bjorken x probed by the process is 
large, the effects of initial-state collinear log -^'s is always modest, and that even though 
total cross-sections computed in 5-flavor schemes may indeed display a smaller uncertainty, 
such logarithms do not spoil the convergence of perturbation theory in 4-flavor scheme 
calculations. 

We have identified two main and different reasons, one of dynamical and the other of 
kinematical nature. 

The first is that the effects of the resummation of the log -^ universal terms is quite 

b 

small and relevant mainly at large Bjorken x and in general keeping only the explicit 
logs appearing at NLO is in fact an excellent approximation. This observation accounts 
for previously noticed, and yet not well-understood, behaviours, such as the more sizable 
differences between predictions in the two schemes for single top and 66 — )■ i7 at the 
Tevatron than at the LHC [67, 68]. 

The second result of our study is that the effective scale Q which enters in the initial- 
state collinear logarithm while proportional to hardest scale(s) in the process, turns out to 
be modified by universal phase space factors that tend to reduce the size of the logarithms 
for processes taking place at hadron colliders. In our study we have provided a simple ana- 
lytic formula, valid also in the case of processes with two 6 quarks in the initial state, that 
can be used to quantitatively assess the size of such logs for any process at the LHC. We 
have also provided a simple rule to choose the factorization scale where to perform compar- 
isons between calculations in the two schemes. As a result, a consistent and quantitative 
explanation is provided of the many examples where a substantial agreement between total 
cross sections obtained at NLO (and beyond) in the two schemes can be found within the 
expected uncertainties. 

The main outcome of this study is that 4F and 5F schemes provide complementary 
information and it strongly motivates having calculations at higher orders available in both 
schemes for any given process. (Improved) 5-flavor schemes, for example, can typically 
provide quite accurate predictions for total rates and being simpler, in some cases allow 
the calculations to be performed at NNLO, such as those already available for bb ^ H, Z 
and foreseeable in the near future, such as t-channel single-top production. On the other 
hand, being often the effects of resummation very mild, 4-flavor calculations can be also 
put to use. They can be useful to achieve accurate fully exclusive predictions, such as those 
obtained from Monte Carlo programs at NLO accuracy. Promoting a 4-flavor calculation at 
NLO to an event generator is nowadays a fully automatic procedure and kinematic effects 
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due to the b quark mass can be taken into account from the start leading for example to a 
more accurate description of the kinematics of the spectator b quarks in all phase space. 

As we have argued, processes that can be described by two b quarks in the initial 
state, such as pp — )■ Hbb and pp — )• Zbb entail a simple extension of our approach. The 
comparison between the 5F and 4F schemes can be performed at NNLO for the former and 
NLO level for the latter and results could be compared with alternative approaches, such 
as the proposal of Ref. [97]. We leave this to future work. In addition, given the absence 
of any phenomenological motivation for an intrinsic charm contribution to the proton, it 
would be certainly interesting to investigate to which extent the results obtained here for 
b quarks can be applied to c quarks. On a more speculative level, one could also wonder 
whether the approach followed in this work could be employed to develop a criterium for 
an optimal central factorization scale choice for processes at the LHC. We have argued 
that initial state collinear logs display a universal form valid at all orders which in fact is 
independent of the details of splitting as well as of the hard scattering, but only due to 
phase space. This suggests that for arbitrary processes with light partons in the initial state 
collinear logs are of the form log .2 and therefore factorization scales chosen on event- 

by-event basis as fip ~ Qiz) would lead to an improvement of the perturbative expansion. 
Developments in this direction would be certainly welcome. 
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A LO matrix elements for heavy quark production 

In this Appendix we give the explicit expressions for leading order partonic cross sections 
relevant for Wb production and tb production. 

A.l Heavy quark production (DIS and single top) 

We first consider the production of a heavy quark Qm of mass m in association with another 
heavy quark Qm of mass M (with M > m). In the 4F scheme the heavy quark of mass 
m is treated as a massive final state and does not contribute to the proton wave function. 
The relevant leading-order partonic subprocess is 

g{p) + B*{q) ^ Q^{ki) + QM{k2). (A.l) 

The results presented in this Appendix are relevant both for the calculation of electro- 
production cross sections, where the virtual vector boson B* is assumed to be radiated by 
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Figure 14. LO Fcynman diagrams associated to the Qm Qm production 



an incoming lepton, and for hadroproduction, where the vector boson is exhanged with a 
Ught quark. The process is depicted in Fig. 2. The amphtude in the 4F scheme is obtained 
from the diagrams displayed in figure 14. We define 



t = {p — ki) =171 —2p-ki, 
u = {p-k2f = M^ -2p-k2, 
s={p + qf = 2p-q-Q^. 

For a generic couphng between the vector boson B and the heavy quarks, 



the matrix element M^ reads 
M^" = -iggstcu{k2) 



^ >-^i + m _, + .^. ^^-y+M ^, 



t — m^ 



u 



M2 



v{ki)ea{p)- 



(A.2) 



(A.3) 



(A.4) 



The transverse, longitudinal and axial components of the squared amplitude can be isolated 
by suitable projection operators. We define 



Mg 
Ml 

M3 



1 



g^'^M^Ml 



4Q2 



. 2Q2 



'(S + Q2)2 



p^p-'M^Ml 



(A.5) 
(A.6) 

(A.7) 
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We find 
Mg 



2glCF J ,^2 

tW 



id + al) 



"l"-! 



+ 2m^M^s? - 2si [m'^ui + M^'ti + (m^ + M^)niii 



+ tiui (2Q^ - 2siQ2 + ^2 ^ uf) + 2Q2 [m\i(2ti + m) + M2ti(2ni + ti)] 



+ l6gL9RmM 



si{uim^ + tiM^) + ^1*1(81 - Q^ 



Ml 



M:^ 



IQglCFQ^gl + gl) 



s\tiui 



siiuivf? + tiNf) + uiti{si - Q2) 



(A.8) 
(A.9) 



_^9lCpQ^gl-gl) 



sjtlul 



2sI{MHi - mV) + 2sim'^M'^{tl - uj) 
+ 2siQ'^{M'^t\ - m^u\) - 2siuiti{si + 2Q'^){M'^ - rr?) 
- tiui{ti - ui){2Q^ - 2Q'^si + si) 



(A.IO) 



where ti = t — iv? , ui = u — M^ and si = s + Q"^ . The two-body invariant phase space 
measure is given by 



(i$2 



d^ki d^k2 



{2tt)^2Ei (27r)32^2 
and therefore, by definition, [98] 



{27r)^ 6^^\p + q - ki - k2) 



1 



Svrsi 



-dt 



(A.ll) 



The partonic cross-section 



daf 
dt 


d4si OTTSl 


daf 
dt 


~ 32si Svrsi ^ 


daf 
dt 


= ' ' Ms. 

64si Svrsi 


daf 


daf , 3 daf 



(A.12) 
(A.13) 
(A.14) 



dt 



dt 2 dt 



(A.15) 



has singularities for t — t- in the limit tti — t- and for u — t- in the limit M — > 0. The 
amplitudes Eqs. (A. 8, A. 9, A. 10) are manifestly symmetric under the exchange ti < — > ui, 

M i — > m. 
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In case of DIS {gn = Ql = etV^'^^^e, M = m = mi,) Eq. (A. 15) reads 

Amlsf + 2mlsl{14tiUiQ^ - s)l^Q^ - 2sitiui] 



d&o^ Us Cp vrage^ 



H 



dt Ssftjuf 

-tiUi[24QHiUi{si - Q2) + s?(2Q4 _ 2Q^si + sf - 2tiui)] 



(A.16) 



where eb = —1/3 is the electric charge of the bottom quark. 

In case of single top {gR = 0, gi = gw/V2, M = Mt and m = mt) Eq. (A. 15) reads 



da^^ as Cp gl 



dt 128sfifMf 



2s\{mlui + M/ti) - 2s\mlMl 



2\r^2 



+4(M/ii + miui) {sfQ^ - GQ^huiSi) + 2sftiui(si - 2Q'){mi + M^ 



+s|«iii(2Q^ - 2Q^si + si- 2tiui) - 2AQH{ui{si - Q 



(A.17) 



Choosing the partonic centre-of-mass frame of the B*g system, the explicit expressions 
of the four-momenta are 



2^s 



2^s 



^,0,0, 2. 

2^^ ' ' ' 2Vi 

ki = f £^1, 0, |A;| sin6', |fc| cos^' 

/c2 = ( £^25 0, — |A;| sin0, — |A:| cos6'j , 

where Ei, E2 and |A;| are fixed by energy-momentum conservation and mass-shell relations: 

,2_ A(5,M2,m2) 



\kY 

El 
E2 



4s 
s + rm? - M'^ 

2^s 
s-m^ + M"^ 

2^ 



(A.18) 



where A(a, 6, c) = a^ + 1? + c^ - 2ab - 2ac - 2bc. 

The masses of the heavy quarks act as regulators of collinear singularities, originated 
by poles in ti,ui in the squared amplitudes. Since t- < t < t+, with 



2s V 



(A.19) 
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Figure 15. LO Feynman diagrams associated to the t— and s— channel Wb production 



we find 



and 



L, 



Lu 



*+ dt 



f-_ t — m? 
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u-M"^ 



log 



log 



s + m? - M"^ - A^(s,M2,m2)' 
s + M"^ -w? + \^{s,]\/P,m?) 
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In the limit m^ <^ M^, relevant for single top production, we have 



L, 



log 



s 


( 
1- 

V 


M 
s 



^«^l°g]i^' 



(A.22) 



whereas for m^ = M^ —t- as in the case of bb or cc production, we find 






(A.23) 



A. 2 Associated Wb production 



We now consider associated Wb production. In the simplified case in which only one light 
quark appears in the initial state, e.g. the u quark, the relevant leading-order subprocesses 
are 



g{pi) + u{p2) 
g{pi) + u{p2) 



b{P3) + W-{p^) 
b{p3) + W+{pi) 



(A.24) 



and the LO Fewynman diagrams are shown in figure 15. We define 



s = {pi +P2f = 2pi ■P2 

t = {pi- p'if = ml- 2pi ■ p3 



u 



{P2-Pz) =m^-2p2-p3- 
This process is related to the one discussed above by crossing, namely 



t f-)- s 



WH- t 



s -^ u. 
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Therefore the transversal, longitudinal and axial components of the squared matrix ele- 
ments can be obtained from Eqs. (A. 8, A. 9, A. 10) by setting gji = 0, g^ = g^/\/^, m = 0, 
M = nrn, and by crossing the kinematic variables according to the above equation. The 
partonic differential cross section is given by 



da 



;^4F 



OigGp 



mr — m, 



\{2s + t)-mt {2M^ + 2M^t - (s + tf) 



dt l2V2s^t-mlf 

-ml ( - 4.M^ + 2M^t - 2M^ (s^ - st + 2t^) + t{s + t)A 

-2M^t (2M^ - 2M^{s + t) + s^ + f) 



(A.25) 



In the partonic centre of mass frame we can write the 4-momenta of the particles as 

Pi -- 



P2 



\/S \/S 

2 ' ' ' 2 



J^ -^^ 
2 ' ' ' 2 



P3 = (-^3,0, |p1sin6', |p1cos6I) 
P4 = (£"4,0, — IpI sin^, — IpI cos( 

where, by the energy-momentum conservation. 



\P\ 
E3 

Ea 



2^s 
s + ml- Ml 



W 



2^s 



s + M^-ml 



2^s 



(A.26) 



The 4F LO partonic cross section is readily obtained by integration in the range t- < t < 
i_l_, with 



t- = nif, 



tj_ = ruh 



1 

2 L' 



2 L 



s + ml-M^ + X^ {s, ml M^) 
s + ml-M^-X^ {s, ml M^) 



(A.27) 



The dominant contribution in the collinear limit t — )• 0, mb — )• is therefore proportional 

to 

{s - Mw? 



L = loe 



i_ — m^ 



log 



m^s 



(A.28) 
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B Higher orders 

In Section 5.3 we have considered the general class of processes 

I{q) + g{p) ^ b{k) + g{h) + ...+ g{kn) + X{P), (B.l) 

where X is a generic heavy system, such that P^ > M^. By using a simple argument we 
have shown that transverse momentum of each of the n gluons in the final state is bounded 
from above by the same quantity and therefore expect a factor of 

log«^ (B.2) 

for each emitted gluon. 

This simple argument has the disadvantage that it does not provide us we the exact 
expression of the cross section in the collinear limit. The argument can be made more 
precise by studying the explicit expression of the squared amplitude and of the phase space 
measure for multiparton emission in the collinear limit. In order to simplify our discussion, 
we specialize to the case when X is a real heavy particle, such as a top quark or aW boson. 

The dominant Feynman diagrams in the collinear limit are shown in Fig. 13. 

It will therefore be convenient to express both the squared amplitude and the phase 
space measure in terms of the variables 

t = {p-k)^ 
ti = {p-k- kif 

tn = {p-k-ki-...- knf (B.3) 

because the leading contribution to the squared amplitude in the collinear limit will be 

proportional to 

1 1 1 



t-mlU-ml"- U-m''- 



(B.4) 
ife El - mi Zn - mi 

The (n + 2)-body phase space measure can be worked out by the usual factorization tech- 
nique [99]. We obtain 

d^n+2{P,q;k,ki,...,kn,P) = / -—^d^2{P,q;k,Pn)d^n+l{Pn;ki,...,kn,P), 

(B.5) 
where 

P^ = ki + ... + kn + P; Pn=Ml (B.6) 

In the reference frame defined by Eqs. (5.32) and (5.33) we have 

d^2ip,q;k,Pn) = -^^-^dHcose, (B.7) 
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where 9, (f> are the polar and azimutal angles of k, and 

\k\ 



2_ A(s,m2,M2) 



4s 



Because 



t = {p — k)"^ = m\ 






ni^ + \k\'^ — \k\ cos 9 



we may recast Eq. (B.7) in the Lorentz-invariant form 

The kinematic bounds on t are easily read off Eqs. (B.8,B.9): 
r <t<t+ 



(B.8) 
(B.9) 

(B.IO) 



t^ 



rrih 



2s 



2 7\/^2^ 



s + mi-M^±XHs,mi,M^ 



(B.ll) 



The factorization procedure can be iterated until the whole phase space is expressed as a 
product of two-body phase space measures. Let us take one more step explicitly: 



d^n+l{Pn; ku...,kn,P)= / -^^ d^2{Pn] ki, Pn-l)d^n{Pn-i; ^2, • • • , K, P), 



where 



Choosing 



Pn-i = k2 + ... + kn + P; Pl_^ = Ml_^. 



p- k= ( ^/t + uP, 0, 0, u 



g2 +a;2^o,0, -w 



(B.12) 
(B.13) 

(B.14) 
(B.15) 



the condition (p — k + q^ = M^ gives 



2 + „2n 



OJ 



XHMi,t,q 



2M„ 



(B.16) 



A straightforward calculation yields 



ti = [p-k-kif = t 
so that 



2M2 



d^2{Pn-.ki,Pn-l) 



M^ + t-q"^ - X2 {M^,t, q^) cos 9i 



dU 



16vr^A2(M2,t,g2) 



(B.17) 
(B.18) 
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with 



ti <ti< t{ 





tf = t- 


2M2 


'Ml + t-q^±\^{Mlt,q^) 




(B.19) 


inus 

d^n+2{p,q;k,ki,. 


.,K,P) = 










(""-dMl^, 1 rfti ,, ,^ rP ' 

/ o ^^ ^2 1 o n #ia^n(-Pn-l K2,. 
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..,A;„,P) 
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The procedure can be iterated, to obtain 
d^n+2{v,(i;k,ki, . . . ,kn,P) 



M2 



(27r)" (47r)2'^+2 
dMl I dMl_^... / dMl 



A/2 
A/2 



M2 



di 



t+ 



din 



dt„- 



s + Q^Jt- \-2{Mltn-i,q')Jt^ A2 (Ml, t„„2,<?2) A- A2(M2,t,g2 






(B.21) 



with 



t7<t,<tj 



ij = t,-i 






M2 .^^ + t^._, - g2 ± ^2 (M2 .+^, tj_i, , 



(B.22) 



for 1 < J < n, where we understand that to ^ t-, -^0 = -^• 

Our next task is evaluating the most singular term in the cross section, cj \ in the 
limit mi, — )• 0. This is obtained by integrating the squared amplitude, Eq. (B.4), using the 
integration measure Eq. (B.21). Let us consider the case n = 1 as an example. In this case 



,coll 



A/2 
(v^-mb)2 

A/2 



dMf 



dMl 



t+ 



dt- 



1 



t- t-ml x\ {Mf,t, g2) Jt- h - ml 



dU 



dt 



t-mlx-2{Mlt,q'^) 



log-i 



tT - mi 



U - mi 



(B.23) 
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where 



tf =t 



2M2 



rrih 



Q' 



2s 



Ml+t-q^±X2{Ml,t,q^ 



s + ml-Ml± A2 (s, ml, Ml 



(B.24) 
(B.25) 



and we have omitted an over ah proportionahty constant which takes into account, among 
other factors, the result of aziniuthal integrations. The cohinear singularity occurs at the 
upper bound of integration, t"*", which vanishes as rrih — )■ 0, as can be seen expanding t^ 
Eq. (B.25) in powers of m^ to the first non-trivial order: 



{s + Q^){s-Ml) 
s 



+ 0{m\ 



We now note that both A ^/^(M^,t, g^) and t^ Eq. (B.24) are regular in i = i"* 
can therefore be computed at t = t^. Indeed 



{M( - M'){M{ + Q') 



+ 0(t) 



(B.26) 
(B.27) 

0, and 

(B.28) 
(B.29) 



In order to set an upper bound for the most singular contribution to 0"™^\ Eq. (B.23), we 
neglect the 0{t) and O(t^) contributions in Eqs. (B.28)-(B.29), and note that the limits of 
integration in ti are bound by 



*r> 



tj <t~ 



{s-M^){s + Q^) 



NP + Q^ 
Ml + Q2 



-mu 



^M^ + Q^ ^ ^M^ + Q^ 



Ml 



< -mi 



M2 ■ 



(B.30) 
(B.31) 



The first inequality is a consequence of the fact that t]~ is an increasing function of M^, 
and Ml < s. The second inequality follows from t < t~^ and Ml > M^. In so doing, the 
integrations in ti and t become independent. The integration in ti yields 



log- 



t; 



mr 



< log 



(s-M' 



2n2 



snir 



log 



Q\z) 



mr 



(B.32) 



The integration in t, up to less singular terms, gives rise to a logarithm which is also 
bounded by log Q^{z)/m.1: 



log 



t+ — m. 



,-log<-f'%log'»-..f>^logg!M 



sm,i 



sm^t 



mr 



(B.33) 
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Figure 16. gg -^ Z/Hbb production. 



since Mf > M^. Thus, 



^coU < 



M2 Af^ + Q 



2 _L n2 ^"^ 



mt 



(B.34) 



Hz) 



This proves that the argument of the cohinear logarithms is bounded from above by 

■■"6 

even in presence of one extra emission. The proof can be easily extended to the case 
of n extra gluon emissions, just noting that the integrations over Af? in Eq. B.21 can 
disentangled by using M^ as the upper limit of integration for all of them, neglecting tj in 
the A2(M? tfc,g^) denominators, and then recursively building lower and upper bounds to 
the tJ limits. 



The above proof can also be used with minor modifications to show that 



\z) 



are 



universal kinematic terms associated to the collinear logs in processes that can be described 
via 6-PDF for both initial legs, such as Hbh and Zhh production displayed in figure 16. 
In this case one can write: 



O"coll oc 



(v^-mj,)2 ^^^ r-tl 



{m^+MY ■Si Jt:^ *2 - ml Jt- h - rri] 



jdti 



(B.35) 



with 



si = {Pb+PMf 

h = (Pi - Pbf 

t2 = {P2 - Pbf 
± 2 1 Si - ^2 

2 Si 



t 



ml 



t 



ml 



[si - KP + ml) ± A2 {si,M'^,ml 



s - si + ml ± \2{s,ml,si) 



(B.36) 
(B.37) 
(B.38) 

(B.39) 
(B.40) 
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Integration over ti directly leads to 



-^ 1 ~ log ^ 

t{ - mf sim( 



Ti — TILZ I Si — IVl ) ,„ 

log:^ |=^log^^ ^ (B.41) 



while integration over t2 gives 



tZ — mf smf 



leading to 



0-collOC / log 2 log 2— (^•'^3) 

y(„,+M)2 Si sim^ smf 

It is easy to see that the si ~ {\^ — mi,)'^ and si ~ (7nf, + Af)^ integration regions each give 
rise to single logs of the form log — ^ . The single logs correspond to the case where only 
one of the two 6's is collinear to the initial state while the other takes part to the genuine 
hard scattering (in the 5F language this configuration is described by gb — t- bh,bZ). The 
double log (i.e., the dominant contribution that in the 5F approach leads to the leading 
order process bb — ?• Z,h) is obtained from the intermediate region of integration for si, 
where si ~ S2 and s ^ si and si ^ M^, a region that is always kinematically accessible 
due to the identity 

(B.44) 



In such a region 
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-- Sbb - Si 


- S2 + ml + 


ml + M\ 
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- M2)2 

siml 


log 2 

smf 


ml 
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C Explicit NLO expression for h 

The coefficients a/^l (with j = T,,g and p = 1,2), up to constant terms (that we set to zero 
in order to fulfil the boundary conditions b{x,ml) = 0) are given by [49] 

«?^f^'4) =2P,,(z)log-4 (C.l) 
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(2) / ^J■- 



[cfTr [8P,<,(z)log(l - z) - (2 - 4z + 8z^)\ogz- (1 - 4z) 



-CaTr 



Q ^2 

8Pgg(z) log(l - z) + (4 + 16z) logz + — + 2 + 16z - — z 
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16 



(z^ + a-zf) 



lO] 



2 M 
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8P,<,(z)[21ogzlog(l - z) - log2(l - z) + 2C(2)] 
(2 - 4z + Sz^) log^ z - 16z(l - z) log(l - z) 



-(6 - 8z + 16z2) log z - 28 + 58z - 40z' 



+CaTr 



8P,g(z)[Li2(-z) + logzlog(l + z)] + (8 - 16z + IGz^) log^(l - z) 
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+ (4 + 8z) log^ z + 16zC(2) + 16z(l - z) log(l - z) - 4 + 32z + — -z^ log z 



- + 16-100Z+— z^ 
9z 9 



for the gluon initiated splitting and by 
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(C.3) 


J "^6 



for the quark-initiated sphtting. In the above expression we used the Altarelh-Parisi spht- 
ting function Pqg defined as 



Pqg{z) = Tr[z^ + (1 - zf] Tr 



(C.4) 
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